BOREL HIERARCHY AND OMEGA 
CONTEXT FREE LANGUAGES 



Olivier Finkel * 

Equipe de Logique Mathematique 
CNRS et Universite Paris 7, U.F.R. de Mathematiques 
2 Place Jussieu 75251 Paris cedex 05, France. 



Abstract 

We give in this paper additional answers to questions of Lescow and Thomas [Logi- 
cal Specifications of Infinite Computations, In:" A Decade of Concurrency", Springer 
LNCS 803 (1994), 583-621], proving topological properties of omega context free lan- 
guages (w-CFL) which extend those of [O. Finkel, Topological Properties of Omega 
Context Free Languages, Theoretical Computer Science, Vol. 262 (1-2), 2001, p. 
669-697]: there exist some cj-CFL which are non Borel sets and one cannot decide 
whether an w-CFL is a Borel set. We give also an answer to a question of Niwin- 
ski [Problem on cj-Powers Posed in the Proceedings of the 1990 Workshop "Logics 
and Recognizable Sets"] and of Simonnet [Automates et Theorie Descriptive, Ph.D. 
Thesis, Universite Paris 7, March 1992] about a;-powers of finitary languages, giving 
an example of a finitary context free language L such that L'^ is not a Borel set. 
Then we prove some recursive analogues to preceding properties: in particular one 
cannot decide whether an w-CFL is an arithmetical set. Finally we extend some 
results to context free sets of infinite trees. 
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1 Introduction 



Since Biichi studied the oj-languages recognized by finite automata to prove 
the decidability of the monadic second order theory of one successor over 
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the integers [BiicGOa] the so called a;-regular languages have been intensively 
studied. See [Tho90] and [PPOl] for many results and references. 

As pushdown automata are a natural extension of finite automata, Cohen and 
Gold [CG77] , [CG78] and Linna [Lin76] studied the a;-languages accepted 

by omega pushdown automata, considering various acceptance conditions for 
omega words. It turned out that the omega languages accepted by omega 
pushdown automata were also those generated by context free grammars where 
infinite derivations are considered , also studied by Nivat [Niv77], [Niv78] 
and Boasson and Nivat [BN80]. These languages were then called the omega 
context free languages (cj-CFL). See also Staiger's paper [Sta97a] for a survey 
of general theory of w-languages. 

Topological properties of a;-regular languages were first studied by Landweber 
in [Lan69] where he showed that these languages are boolean combinations of 
Gs sets. He also characterized the cj-regular languages in each of the Borel 
classes F, G, Fo-, G^, and showed that one can decide, for an effectively given 
a;- regular language L, whether L is in the Borel class F, G, F^-, or Gs- 
It turned out that an a;-regular language is in the class G^ iff it is accepted by 
a deterministic Biichi automaton. These results were extended to deterministic 
cu-CFL by Linna [Lin77]. In the non deterministic case, Cohen and Gold proved 
in [CG78] that one cannot decide whether an u-CFL is in the class F, G or 
Gs. 

We have begun a similar study for u-CFL in [FinOla]. we proved that u-CFL 
exhaust the finite ranks of the Borel hierarchy and that, for any Borel class 
S° or n°, n being an integer, one cannot decide whether an a;-CFL is in 
or n°. Our proof used the Wadge game and the operation of exponentiation 
of sets defined by Duparc [DupOl]. 

We pursue this study in this paper. We first show that there exist some u- 
CFL which are analytic but non Borel sets. Then we extend the preceding 
undecidability result to every Borel class (of finite or infinite rank) and we 
prove that one cannot even decide whether an w-CFL is a Borel set. 

The question of the topological complexity of the cu-power of a finitary lan- 
guage is mentioned in [Sta97a] [Sta97b]. Niwinski asked in [Niw90] for an 
example of a (finitary) language L such that is not a Borel set. Simonnet 

asked in [Sim92] for the topological complexity of L'^ where L is a context 
free language. We proved in [FinOla] that there exist context free languages 
Ln such that (Ln)"^ is a 11° -complete set for each integer n > 1. 

We give here an example of a context free language L such that L'^ is an 
analytic but not Borel set, answering to questions of Niwinski and Simonnet. 

Then we derive some new arithmetical properties of omega context free lan- 
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guagcs from the preceding topological properties. We prove that one cannot 
decide whether an u-CFL is an arithmetical set in |Jj>i S„. Then we show that 
one cannot decide whether the complement of an u-CFL is accepted by a (non 
deterministic) Turing machine (or more generally by a non deterministic X- 
automaton as defined in [EH93]) with Biichi (respectively MuUer) acceptance 
condition. The above results give additional answers to questions of Thomas 
and Lescow [LT94]. 

Finally we extend some undecidability results to context free sets of infinite 
trees, as defined by Saoudi [Sao92]. 

The paper is organized as follows. In sections 2 and 3, we first review some 
above definitions and results about w-regular, cj-context free languages, and 
topology. Then in section 4 we prove our main topological results from which 
we deduce in section 5 the result about a;-powers and in section 6 arithmetical 
properties of o'-CFL. Section 7 deals with context free languages of infinite 
trees. 



2 u-regulai and a;-context free languages 

We assume the reader to be familiar with the theory of formal languages and of 

u;- regular languages, see for example [HU69] ,[Tho90]. We first recall some of 

the definitions and results concerning a;-regular and w-context free languages 

and omega pushdown automata as presented in [Tho90] [CG77] , [CG78]. 

When S is a finite alphabet, a finite string (word) over S is any sequence 

X = xi . . . Xk , where G S for i = 1, . . . , /c ,and k is an integer > 1. The 

length of x is fc, denoted by |a:| . 

we write x{i) — Xi and x[i\ — x{l) . . .x{i) for i < k. 

If |a:| = , a; is the empty word denoted by A. 

S* is the set of finite words over E. 

The first infinite ordinal is w. 

An a;- word over E is an a; -sequence ai . . . a„ . . ., where e E, Vi > 1. 
When a is an w-word over E, we write a — a{l)a{2) . . . cr{n) . . . 
a[n] = a{l)a{2) . . . a{n) is the finite word of length n, prefix of a. 
The set of w-words over the alphabet S is denoted by E'^. 
An w-language over an alphabet E is a subset of E"^. 

The usual concatenation product of two finite words u and v is denoted u.v 
(and sometimes just uv). This product is extended to the product of a finite 
word u and an w-word v: the infinite word u.v is then the w-word such that: 
{u.v){k) = u{k) if k < \u\ , and 
{u.v){k) = v{k — \u\) ii k > \u\. 
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For V C J:* , V ^ {a ^ ui .. .Un ... e \ Ui e V,yi > 1} is the 6^-power of 
V. 

For V C E*, the complement of V (in E*) is E* — denoted V~ . 
For a subset A C E'^, the complement of A is E'^ — A denoted A~. 

The prefix relation is denoted the finite word w is a prefix of the finite 
word V (denoted uC.v)ii and only if there exists a (finite) word w such that 

V = u.w. 

This definition is extended to finite words which are prefixes of w-words: 
the finite word u is a. prefix of the w-word v (denoted u C. v) iS there exists 
an o'-word w such that v — u.w. 

Definition 2.1 A finite state machine (FSM) is a quadruple M = {K, E, S, Qq), 
where K is a finite set of states. E is a finite input alphabet, Qq & K is the 
initial state and S is a mapping from x E into 2^ . A FSM is called deter- 
ministic (DFSM) iff : 5 : K xE^ K. 

A Biichi automaton (BA) is a 5-tuple M — {K,Yl,S,qo,F) where M' — 
[K, E, 5, go) is a finite state machine and F C K is the set of final states. 
A Muller automaton (MA) is a 5-tuple M = [K^H^S^qo^F) where M' — 
{K, E, 5, go) is a FSM and F C 2^ is the collection of designated state sets. 
A Biichi or Muller automaton is said deterministic if the associated FSM is 
deterministic. 

Let a — 0102 . . . a„ . . . be an oo-word over E. 

A sequence of states r = gig2...g„... is called an (infinite) run of M = 
(A', E,5, go) on a, starting in state p, iff: 1) qi = p and 2) for each i > 1, 
qi+i e S{qi,ai). 

In case a run r of M on a starts in state qo, we call it simply "a run of M on 
a " . 

For every (infinite) run r = gig2 ■ ■ - Qn ■ ■ ■ of M , In{r) is the set of states in 
K entered by M infinitely many times during run r: 
In{r) = {q & K \ {i > 1 \ qi = q} is infinite }. 

For M = (K, E, 5, go, F) a BA , the u-language accepted by M is L{M) — 
{a G E'^ I there exists a run r of M on a such that In{r) fl F 7^ 0}. 
For M = (if, E, 5, go, F) a MA, the u-language accepted by M is L[M) — 
{cr e E'^ I there exists a run r of M on a such that In{r) e F}. 

The classical result of R. Mc Naughton [MaN66] established that the expressive 
power of deterministic MA (DMA) is equal to the expressive power of non 
deterministic MA (NDMA) which is also equal to the expressive power of non 
deterministic BA (NDBA) . 

There is also a characterization of languages accepted by MA by means of the 
"w-Kleene closure" of which we give now the definition: 

Definition 2.2 For any family L of finitary languages over the alphabet E, 
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the uj-Kleene closure of L, is : 

CO - KC{L) = {U'^.Ui.V;' I Ui,Vi eL,yie [l,n]} 

Theorem 2.3 For any co-language L, the following conditions are equiva- 
lent: 

(1) L belongs to uj — KC{REG) , where REG is the class of (finitary) regular 
languages. 

(2) There exists a DMA that accepts L. 

(3) There exists a MA that accepts L. 

(4) There exists a BA that accepts L. 

An tu-language L satisfying one of the conditions of the above Theorem is 
called an uj-regular language. The class of u-regular languages will be denoted 
by REG^. 

We now define pushdown machines and the class of a;-context free languages. 

Definition 2.4 A pushdown machine (PDM) is a 6-tuple M = (K, S, F, S, Qq, Zq), 
where K is a finite set of states, T, is a finite input alphabet, T is a finite push- 
down alphabet, qo & K is the initial state, Zq &r is the start symbol, and 5 is 
a mapping from if x (E U {A}) x F to finite subsets of K x F* . 
If ^ E describes the pushdown store content, the leftmost symbol will be 
assumed to be on " top" of the store. A configuration of a PDM is a pair {q, 7) 
where q E K and 7 G F*. 

For a e E U {A}, /3, 7 e F* and Z eT, if {p, (3) is in S{q, a, Z), then we write 
a : (q, Z-f) H^m {p,/3i)- 

H>^^ is the transitive and reflexive closure of (The subscript M will be 

omitted whenever the meaning remains clear). 

Let a = aia2 . . .an ■ ■ . be an co-word over E. an infinite sequence of configura- 
tions r — {qi,Ji)i>i is called a complete run of M on a, starting in configura- 
tion (p,7), iff: 

(1) (?i,7i) = {P,l) 

(2) for each i > 1, there exists 6^ G S U {A} satisfying bi : {qi^'^i) 
(gi+i, 7i+i) such that 0102 . . . a„ . . . = 6162 • • • 6n • • • 

As for FSM, for every such run, In{r) is the set of all states entered infinitely 
often during run r. 

A complete run r of M on a , starting in configuration {qo, Zq), will be simply 
called " a run of M on a ". 

Definition 2.5 A Biichi pushdown automaton (BPDA) is a 7-tuple M = 
{K, E, F, 6, qo, Zo, F) where M' = {K, E, F, 5, qo, Zo) is a PDM and F C K is 
the set of final states. 
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The u-language accepted by M is L{M) = {cr e E'^ | there exists a complete 
run r of M on a such that In{r) fl F 0}. 

Definition 2.6 A Muller pushdown automaton (MPDA) is a 7-tuple M — 
{K, E, r, 5, go, Zq, F) where M' = {K, E, T, 5, go, ^o) is a PDM and F <Z2^ is 

the collection of designated state sets. 

The u-language accepted by M is L{M) = {cr e E'^ | there exists a complete 
run r of M on a such that In{r) G F}. 

Remark 2.7 We consider here two acceptance conditions for uj-words, the 
Biichi and the Muller acceptance conditions, respectively denoted 2- acceptance 
and 3-acceptance in [Lan69] and in [CG78] and (m/, □) and {inf,=) in 
[Sta97a]. 

Cohen and Gold and independently Linna established a characterization The- 
orem for (j-CFL: 

Theorem 2.8 LetCFL he the class of conteoct free (finitary) languages. Then 
for any oo -language L the following three conditions are equivalent: 

(1) Leuj-KC{CFL). 

(2) There exists a BP DA that accepts L. 

(3) There exists a MPDA that accepts L. 

In [CG77] are also studied cu-languages generated by w-context free grammars 
and it is shown that each of the conditions 1), 2), and 3) of the above Theorem 
is also equivalent to: 4) L is generated by a context free grammar G by leftmost 
derivations. These grammars are also studied in [Niv77] [Niv78]. 
Then we can let the following definition: 

Definition 2.9 An ui-language is an oj- context free language (oo-CFL) (or 
context free ui-language ) iff it satisfies one of the conditions of the above 
Theorem. 



3 Topology 

We assume the reader to be familiar with basic notions of topology which may 
be found in [LT94] [PPOl] [Kur66] [Mos80] [Kec95]. 

Topology is an important tool for the study of a;-languages, and leads to 

characterization of several classes of cj-languages. 

For a finite alphabet X, we consider as a topological space with the Cantor 
topology. The open sets of X'^ are the sets in the form W.X^ , where W C X* . 
A set L C X'^ is a closed set iff its complement X'^ — L is an open set. The 
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class of open sets of X'^ will be denoted by G or by S^. The class of closed 
sets will be denoted by F or by II^. Define now the next classes of the Borel 
Hierarchy: 

Definition 3.1 The classes and 11^ of the Borel Hierarchy on the topo- 
logical space are defined as follows: 
S° is the class of open sets of X^ . 

is the class of closed sets of X"^ . 
Ha or Gs is the class of countable intersections of open sets of X'^ . 

or Fg. is the class of countable unions of closed sets of X'^ . 
And for any integer n>l: 

^n+i ^-^ class of countable unions of Il^-subsets of X'^ . 

nn+i ^■^ th^ class of countable intersections of 1^^- sub sets of X'^ . 

The Borel Hierarchy is also defined for transfinite levels. The classes S° and 

n^, for a countable ordinal a, are defined in the following way: 

5]° is the class of countable unions of subsets of X'^ in U,-<an°. 

n° is the class of countable intersections of subsets of X'^ in U^<q,S°. 

Recall some basic results about these classes, [Mos80]: 

Proposition 3.2 

(a) S° U n° C n n°_,_j^, for each countable ordinal a > 1. 

(b) U-y<Q.I]^ = U-y<Q,n° C S° n n°, for each countable limit ordinal a. 

(c) A set W C X^ is in the class S° iff its complement is in the class 11°. 

(d) S° — n° 7^ and 11° — S° ^ hold for every countable ordinal a>l. 

We shall say that a subset of X'^ is a Borel set of rank a, for a countable 
ordinal a, iff it is in S° U n° but not in U7<a(S° U n^). 

Furthermore, when X is a finite set, there are some subsets of X'^ which are 
not Borel sets. Indeed there exists another hierarchy beyond the Borel hierar- 
chy, which is called the projective hierarchy and which is obtained from the 
Borel hierarchy by successive applications of operations of projection and com- 
plementation. More precisely, a subset A of X^ is in the class Sj; of analytic 
sets iff there exists another finite set Y and a Borel subset B ol {X x YY such 
that x e A -H- 3y e such that (x, y) e B. 

Where {x, y) is the infinite word over the alphabet X xY such that {x, y){i) — 
{x{i),y{i)) for each integer i > 0. 

Now a subset of X'^ is in the class Il\ of coanalytic sets iff its complement 
in X^ is an analytic set. 

The next classes are defined in the same manner, S^^j^-sets of X^ are projec- 
tions of n^-sets and n^_|_j^-sets are the complements of S^^j^-sets. 

Recall also the notion of completeness with regard to reduction by continuous 
functions. 
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A set F C X'^ is a S° (respectively n°)-complete set iff for any set £" C 
{Y a finite alphabet): 

E e (respectively E e n|J) iff there exists a continuous function / from 

into X'^ such that E = /"^(F). 
A similar notion exists for classes of the projective hierarchy: in particular a 
set F C is a (respectively ni)-complete set iff for any set E QV^ {Y 
a finite alphabet): 

F e (respectively E e Ilj;) iff there exists a continuous function / from 
into X'^ such that E = f-\F). 

A S° (respectively 11°, S];)-complete set is a S° (respectively 11°, Sj;)- set 
which is in some sense a set of the highest topological complexity among the 
S° (respectively n°, E^)- sets. 



4 topological properties of a;-CFL 

Recall first previous results. cj-CFL exhaust the finite ranks of the Borel hier- 
archy. 

Theorem 4.1 ([FinOla]) For each integer n > 1, there exist some S°- 
complete u-CFL and some 11^- complete u-CFL. 

Cohen and Gold proved that one cannot decide whether an a;-CFL is in the 
class F, G or G^. We have extended in [FinOla] this result to all classes S° 
and n°, for n an integer > 1. (We say that an o'-CFL A is effectively given 
when a MPDA accepting A is given). 

Theorem 4.2 ([FinOla]) Let n he an integer > 1. Then it is undecidable 
whether an effectively given uj-CFL is in the class S° ( repectively 11° 

When considering a;-CFL, natural questions now arise: are all u-CFL Borel 
sets of finite rank, Borel sets, analytic sets....? First recall the following: 

Theorem 4.3 ([Sta97a]) Every u-CFL over a finite alphabet X is an ana- 
lytic subset of X'^. 

Proof, we just sketch the proof. 

Every o'-CFL ^4 C E'^ is the projection of a deterministic u-CFL onto E'^ but 
deterministic u-CFL are Borel sets of rank at most 3, and it is well known 
that such a projection of a Borel set is an analytic subset of E'*'. Remark that 
in fact each u-CFL is the projection of an u-CFL which is accepted by a 
deterministic Biichi pushdown automaton and therefore which is a Ilj-set. □ 

Remcirk 4.4 This above theorem is in fact true for u-languages accepted 
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by Turing machines which are much more powerful accepting devices than 
pushdown automata [Sta97a]. 

The following question now arises: are there a;-CFL which are analytic but 
not Borel sets? 

Theorem 4.5 There exist uj-CFL which are T,\-complete hence non Borel 
sets. 

Proof. We shall use here results about languages of infinite binary trees whose 
nodes are labelled in a finite alphabet E. 

A node of an infinite binary tree is represented by a finite word over the 
alphabet {/,r} where r means "right" and I means "left". Then an infinite 
binary tree whose nodes are labelled in E is identified with a function t : 
{/, r}* — > E. The set of infinite binary trees labelled in E will be denoted T^. 

There is a natural topology on this set [Mos80] [LT94][Sim92]. It is defined 
by the following distance. Let t and s be two distinct infinite trees in T^. Then 
the distance between t and s is 2^ where n is the smallest integer such that 

t{x) 7^ s{x) for some word x G {l,r}* of length n. 

The open sets arc then in the form Tq.T^ where Tq is a set of finite labelled 
trees. Tq.T^ is the set of infinite binary trees which extend some finite labelled 
binary tree to & Tq, to is here a sort of prefix, an "initial subtree" of a tree in 
to.T^. 

The Borel hierarchy and the projective hierarchy on are defined from open 
sets in the same manner as in the case of the topological space E'^. 

Let t be a tree. A branch i? of t is a subset of the set of nodes of t which is 
linearly ordered by the tree partial order C and which is closed under prefix 
relation, i.e. if x and y are nodes of t such that y E B and x C.y then x E B. 
A branch 5 of a tree is said to be maximal iff there is not any other branch 
of t which strictly contains B. 

Let t be an infinite binary tree in T^. If i? is a maximal branch of t, then this 
branch is infinite. Let {ui)i>o be the enumeration of the nodes in B which is 
strictly increasing for the prefix order. 

The infinite sequence of labels of the nodes of such a maximal branch B. i.e. 
t{uo)t{ui) . . . t{un) ... is called a path. It is an w-word over the alphabet E. 

Let then L C E'*' be an a;- language over E. Then we denote Path{L) the set 
of infinite trees t in such that t has (at least) one path in L. 

It is well known that if L C S*^ is an w-language over E which is a IIj- 
complete subset of E"^ (or a set of higher complexity in the Borel hierarchy) 
then the set Path{L) is a Sj;-complete subset of T^. Hence Path{L) is not a 
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Borel set, [Niw85] [Sim92] [Sim93]. 

Whenever B is an cj-CFL we shall find another w-CFL C and a continuous 
function 



such that Path{B) = h~^{C). For that we will code trees labelled in E by 
words over E U {A} — E^, where A is supposed to be a new letter not in E. 

Consider now the set {I, r}* of nodes of binary infinite trees. For each integer 
n > 0, call Cn the set of words of length n of {l,r}. Then Cq — {A}, Ci = 
{l,r}, C2 = {ll,lr,rl,rr} and so on. C„ is the set of nodes which appear in 
the (n + l)th level of an infinite binary tree. The number of nodes of C„ is 
card{Cn) = 2". We consider now the lexicographic order on C„ (assuming 
that / is before r for this order) . Then, in the enumeration of the nodes with 
regard to this order, the nodes of Ci will be: r; the nodes of C3 will be: 
III, llr, Irl, Irr, rll, rlr, rrl, rrr. 

Let be such an enumeration of C„ in the lexicographic 

order and let v^, . . . , v", . . . , vl^n be the enumeration of the elements of C„ in 
the reverse order. Then for all integers n > and i, 1 < i < 2", it holds that 

'^i — ^2" + l-i- 

We define now the code of a tree t in T^. Let A be a letter not in S. We 
construct an w-word over the alphabet (S U {A}) which will code the tree t. 
We enumerate all the labels of the nodes of a tree in the following manner: 
firstly the label of the node of Co which is t{u\), 

followed by an A, followed by the labels of nodes of Ci in the lexicographic 
order, i.e. t{u])t{u\), followed by an A, followed by the labels of the nodes of 
C2 in the reverse lexicographic order, followed by an A, followed by the labels 
of nodes of C3 in the lexicographic order, and so on . . . 

For each integer n > 0, the labels of the nodes of Cn are enumerated before 
those of Cn+i and these two sets of labels are separated by an A. Moreover the 
labels of the nodes of C2n+i, for n > 0, are enumerated in the lexicographic 
order (for the nodes) and the labels of the nodes of C2n, for n > 0, are 
enumerated in the reverse lexicographic order (for the nodes). 

Then for each tree t in T^, we obtain an a;- word of E U {^4} which will be 
denoted h{t). With the preceding notations it holds that: 



h{t) = t(M?)At(u})t(u^)At(w^)t(^;^)t(^;|)t(t;^)At(M5)^(MD^(«3)^(«4)^(«5)^(«6)^(«7)^(«8)^- ■ ■ 
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Let then h be the mapping from into (SU{A})'^ such that for every labelled 
binary infinite tree t of T^, h{t) is the code of the tree as defined above. It 
is easy to see, from the definition of h and of the order of the enumeration of 
labels of nodes, that /i is a continuous function from into (S U {^})'^. 

Assume now that B is an u-CFL accepted by a Biichi pushdown automa- 
ton M = {K,T,,T,6,qo, Zq, F) where M' = {K,T^,T,d,qQ, Zq) is a pushdown 
machine and F C K is the set of final states. 

Now we are looking for another w-CFL C such that for every tree t G T^, 
h{t) G C if and only if t has a path in B. Then we shall have Path{B) — 
h^\C). 

We shall give a first description of such an cu-CFL C by constructing from 
M another Biichi pushdown automaton M which accepts C . 
The reader can also skip this description and read a second description of 
the cu-CFL C which will be given below. 

Describe first informally the behaviour of the new machine M . When M reads 
a word in the form then using the non determinism it guesses a maximal 
branch of the tree t and simulates on this branch the Biichi pushdown au- 
tomaton M. Finally the acceptation of h{t) by M is related to the acceptation 
of the w-word formed by the labels of this branch by M. 

More formally M = {K,T,,T,d,qo, Zq, F), where 

K = KU{q^ I q G K}U{q^ \ q G K}U{q^ \ q G i^}U{g^ | q G K}U{q^ \ q G K}U{qr} 

E = EU{yl} 

where is a new letter not in F, 

go = go 

Zq — Zq 
F^FU{q^\qeF} 

and the transition relation S is defined by the following cases which will be 
explained below: 

(a) (g, 1/) G S{qo, a, Zq) iff {q, u) G S{qo, a, Zq), for each a G S and i' G F*. 

(b) S{qo,A,Zo) = {qr,Zo). 

(c) 6_{q, a, Z) = iq\ EZ), for each a G E, Z G F U {£;} and g G X. 

(d) 6{q^, a, E) = (q^, EE), for each a G E, and q E K. 

(e) 5lq\ A, Z) = {q\ Z), for each Z G F U {F} and g G K. 

(f) Slq, A, Z) = (g2, Z), for each Z G F U {F} and g G K. 

(g) (5(g^, a, E) = (g^, E), for each a G E, and q e K. 
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(h) S{q^, a, E) = {q^, A), for each a G E, and q e K. 

(i) A, E) = {qr, E), for each q e K. 
(j) 6{q^, A, E) = {qr, E), for each qe K. 

(k) Sjqr, a, Z) = {qr, Z), for each a e (E U {A}) and Z e T U {£;}. 
(1) 5{q^, a, Z) 3 {q', v) iff 5{q, a, Z) 3 {q' , v), for each a e T,, q,q' e K, Z e F, 
and G T*. 

(m) 5(^2^ A, Z) 3 z^) iff A, Z) 3 {q', u), for each q,q' e K, Z e T, and 

u e ^^ 

(n) A, Z) 3 i^) iff S{q, X, Z) 3 {q', p), for each q,q' e K, Z e T, and 

u e r*. 

(o) 5{q^, a, Z) 3 {q', v) iff 5{q, a, Z) 3 {q' , v), for each a e T,, q,q' e K, Z e F, 
and e r*. 

(p) Kq^i ^) 3 (q'^j 6ach a G E, Z G r and q E K. 

(q) a, Z) 9 (g^ Z), for each a G E, Z G T and g G 
(r) 5{q'^, a, Z) 3 {q' , v) iff 5{q, a, Z) 3 {q' , v), for each a eH, q,q' e K, Z eV, 
and V G F*. 

(s) 5{q^, A, Z) = (g^, Z), for each g G /sT, Z G T. 
We describe now more precisely the behaviour of M. 

To the set K of states of M, we add sets of states i^' = {g* | g G K} for each 
integer z G [1,5], and a state g^ which will be a rejecting state. 

We firstly consider only the reading by M of words in the form h{t) where 
t E T^. When M simulates M on the branch it guesses, it enters in a state of 
K, as indicated by (a), {I), (o), (r), or of if it uses a A-transition, i.e. if it 
does not read any letter during this transition, as indicated by (m) — (n). 

When M reads the labels of the nodes of t, it reads successively the labels of 
nodes of Cq, Ci, C2, . . . , Q, . . . 

Let B be the branch which is guessed by M during a reading. 

After the use of one transition rule of (a), (/), (o) or (r), reading the label of 
a node m of S in C„, n > 0, M enters in a state g\ keeping the memory of g, 
and then continues the reading of the (labels of) nodes of C„, pushing an E 
on the top of the stack for every letter of E it reads (transition rules (c), (d)) 
until it reads an A. Then it enters in state g^, keeping again the memory of g, 
(transition rules (e), (f)) and reading the labels of nodes of C„+i, it begins to 
pop an E from the top of the stack for two letters of E it reads, as indicated by 
transition rules {g), {h) (here are used the two sets of states K"^ and K^). Thus 
when the letter at the top of the stack is again a letter of F (and not an E) the 
machine M reads the label of one successor of the node u (this is due to the 
fact that the tree is binary and to the order of the enumeration of the nodes 
we have chosen in the definition of h{t)). It may choose to simulate M on 
this label, as indicated by the transition rules (/), (m), (n), (o), (perhaps after 
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some A-transitions). Otherwise it may choose to wait the next label, entering 
in state g^, as indicated by the transition rules (p), (q), and then simulates M 
as indicated by the transition rule (r). 

Some other transition rules, (b), (i), (j), (k), (s), lead to the rejecting state 
in which M remains for the rest of the reading. But in fact these transition 
rules are never used for the reading of w- words in the form h{t) where t & T^. 

Now we can see that when M simulates M on the branch S, if M enters in a 
state q & K, then M enters in the state q or in the state q^ (when a A-transition 
is used). Thus the choice of the set of accepting states F = F U {q^ \ q G F} 
implies the property: for a tree t e , h{t) e C if and only if t has a path in 
B. 

We are going now to give a second description of the w-CFL C. 

The w-language C which we have constructed from the a;-language B can 
easily be described by means of substitution of context free languages. 
Let first D be the following finitary language over the alphabet (E U {A}): 

D = {u.A.v I u,v eT,* and {\v\ = 2\u\) or {\v\ = 2\u\ + 1) } 

It is easy to see that D is a context free language. 

Now an a;-word a & C may be considered as an w-word a' & B to which we 
add, between two consecutive letters a'{n) and a'{n + 1) of a', a finite word 
Vn belonging to the context free finitary language D. 

Recall now the definition of substitution in languages: A substitution / is 
defined by a mapping E where E = {oi, . . . , a„} and F are two 

finite alphabets, f : ai ^ Li where \/i G [l;n], Lj is a finitary language over 
the alphabet F. 

Now this mapping is extended in the usual manner to finite words: 
f{x{l) . . .x{n)) ^ {ui. . .Un I Ui e f{x{i)), yi e [l;n]} 
where 2^(1), . . . , x{n) are letters in E, and to finitary languages L C E*: 

f{L) = U,eL/(x) 

The substitution / is called A-free if Vi e [1; n] does not contain the empty 
word. In that case the mapping / may be extended to a;-words: 

f{x{l)...x{n)...) = {ui...Un... I «j e /(a;(i)), Vi > 1} 

Let C be a family of languages, if Vi e [1; n] the language belongs to C the 
substitution / is called a C-substitution. 
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Let then g be the substitution E -> P((SU{A})*) defined by: a — )> a.D where 
D is the context free language defined above. Then g is a, A-fi^ee substitution 
and g{B) = C holds. But the languages a.D are context free and CFL^ is 
closed under A- free context free substitution [CG77]. Then B e CFL^^ impfies 
that C e CFL^. 

Hence if i? is a Borel set which is a Ilj-complete subset of S"^ (or a set of 
higher complexity in the Borel hierarchy), the language h~^{C) = Path{B) is 
a Sj;-complete subset of T^. Then the a;-language C is at least Sj-complete 
because h is a, continuous function (note that here /i is a continuous function: 
— >■ (Syi)'^ and the preceding definition of S];-complete set involves con- 
tinuous reductions: X'^ — )■ V^; but the two topological spaces and (S^)'^ 
have good similar properties which enable to extend the previous definition 
to this new case [Mos80][Kec95]). And C is in fact a Sj-complete subset of 
(E U {A})'*' because every w-CFL is an analytic set by Theorem 4.3. 
Then in that case C is not a Borel set because a Sj;-complete set is not a 
Borel set [Kur66][Mos80]. 

Indeed this gives infinitely many non Borel co-CFL , because there exist in- 
finitely many w-CFL of borel rank > 2. □ 

Remark that in the above proof, whenever B is an oj-regular language accepted 
by a Biichi automaton M, the resulting machine M is just a one counter 
machine, i.e. a pushdown machine having a stack alphabet T = {Zq, E}, where 
Zq is the bottom symbol which always remains at the bottom of the pushdown 
store and appears only there. Then at any moment of any computation the 
word in the pushdown store is in the form E^'Zq where n is an integer > 0. 
Thus it holds that: 

Corollciry 4.6 There exist one counter ui-languages which are T,\-complete 
hence non Borel sets. 

Now we can deduce from the preceding proof the following undecidability 
result: 

Theorem 4.7 Let E be an alphabet containing at least two letters. It is unde- 
cidahle, for an effectively given oo-CFL B to determine whether B is a Borel 
subset of E'^. 

Proof. Remark first that h{T^) is the set of w-words in (E^)'^ which belong 
to 

E.A.E\A.E\A.E\A...A.E^\AE''^^' . . . 

In other words this is the set of words in (E^)"^ which contain infinitely many 
occurrences of the letter A, and have 2" letters of E between the nth and the 
{n+l)th occurrences of the letter A. We shall first state the following: 
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Lemma 4.8 Let Tl be a finite alphabet. Then (S^)'^ — h{T^) is an omega 
context free language. 

Proof. Let 

A = (A U U E.A.A U E.A.E.A U E.A.E^).{Ea)'' 

Ai is the set of words in (S^)"^ which have not any word of 'E.A.H'^.A as prefix. 
Ai is clearly an w-regular language hence it is also an w-CFL. 

Let now Bi be the set of finite words over the alphabet which are in the 
form A.u.A.v.A where G S and \v\ < 2\u\. And let B2 be the set of finite 
words over the alphabet which are in the form A.u.A.v where w, f e E and 
\v\ > 2\u\. 

Then it is easy to see that Bi and B2 are context free finitary languages, thus 
the a;-language 

= [(E^)*.Bi.(E^)-] U [(Ea)*.S2.(E^)-] 

is an an omega context free language by Theorem 2.8. 

But (E^)'^ — h{T^) = AiU A2 and the class of context free w-languages is 
closed under union [CG77] therefore {T^aY ~ h{T^) is an omega context free 
language. □ 

We recall now a result established in [FinOla] in the course of the proof of the 
above Theorem 4.2. We had seen that: 

Lemma 4.9 There exists a family of (effectively given) context free u-languages 
{A'^ yY over the alphabet {a, 6, c, d} such that {A'^ yY ^-^ either {a, 6, c, ^ 
, d}'^ or an uj-language which is a Borel set but neither a Yl^-subset nor a 
H%-subset of {a,b,c,*^,d}'^. But one cannot decide which case holds. 

Consider now these languages. Denote B{X, Y) — {AxyY ^ = ^1 ^1 
,d}. 

Then there are two cases. 

In the first case B{X, F) = E'^. 

In the second case B{X, Y) is neither a Lla-subset nor a Sa-subset of E'*'. 

Return now to the previous proof. 

In the first case Path{B{X, Y)) = Path{T.'^) = T^. 

In the second case Path{B{X,Y)) is a S^-complete subset of T^. 

Construct now from B[X,Y) another omega context free language C{X,Y) 
over the alphabet E^ in the same manner as we have constructed C from B 
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in the above proof. 



Let then D{X, Y) = C{X, Y) U [{J^aT - h{T^)]- D{X, Y) is an cj-CFL because 
it is the union of two w-CFL and the class of omega context free languages is 
closed under union. 

Then two cases may happen. 

In the first case, Path{B{X, Y)) = hence h{T^) C C{X, Y) and D{X, Y) = 
(Ea)'^. Therefore D{X,Y) is a closed and open subset of (E^)'^. 

In the second case h-\D{X,Y)) = h-\C{X,Y)) = Path{B{X,Y)) holds by 
construction and then D{X, Y) is a S];-complete subset of (E^)"^, for the same 
reason as C{X, Y) is Sj;-complete . 

But one cannot decide which case holds hence one cannot decide whether the 
context free a;- language is a Borel set. 

To see that the result is also true for an alphabet containing two letters, 
consider the morphism g : {a, 6, c, d, A}* {a, b}* defined by: a — > bab, 

b ba^b, c -> ba^b, {^) ba%, d ba^b, A ba^b. 

This morphism is A-free and may be extended to infinite words in an obvious 
manner, giving a continuous function g : {a, b, c, «-, d, A}'^ — )■ {a, b}"^. 

Let then F{X,Y) = g{D{X,Y)). 

F{X, Y) is an u-CFL because D{X, Y) is an u-CFL and the class of context 
free a;-languages is closed under A-free morphism [CG77]. 
There are again two cases. 

In the first case, D(X,Y) — (E^)'*', hence D{X,Y) is a compact set and, 
the image of a compact set by a continuous function being a compact set, 
F(X, Y) = g{D{X, Y)) is a compact subset of {a, b}^ , therefore it is a closed 
subset of {a, b}^ . 

In the second case, D{X,Y) = g-^{F{X,Y)) and D{X,Y) is a S{-complete 
subset of , thus F(X, Y) is also at least a Sj;-complete subset of {a, fe}"^, and 
in fact it is a S];-complete subset because it is an analytic set as an w-CFL. 

□ 

Remark that we have also extended Theorem 4.2 to all Borel classes: 

Theorem 4.10 Let a he a countable ordinal > 1. Then it is undecidable to 
determine whether an effectively given ui-CFL is in the class S° ( respectively 

Proof. The result has been proved for every finite ordinal (integer) > 1 in 
[FinOla]. Let then o; be a countable infinite ordinal. The above defined cj-CFL 
F{X, Y) is either a Il^-subset or a Sj;-complete subset of {a, b}'^. In the first 
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case it is in the class S° ( respectively 11°) and in the second case it is not a 
Borel set. But one cannot decide which case holds. □ 



5 a;-powers of flnitairy languages 

We study in this section w-powers of finitary languages, i.e. w-languages in 
the form where F is a finitary language, cu-powers of finitary languages 
are always analytic sets because whenever V is finite, is an a;-regular 
language and then it is a boolean combination of S2-scts and whenever V 
is countably infinite, one can fix an enumeration of V and obtain as a 
continuous image of oj'^ (the set of infinite sequences of integers > 0), [Sim92]. 

Niwinski asked in [Niw90] for an example of finitary language W such that 

is an analytic but non Borel set. 
From the results of preceding section, we can easily find an example of a 
context free language W such that is not a Borel set. 

Consider the construction of the a;- language C from the cu-language B CI]^ 
in the proof of Theorem 4.5. As stated above, if g is the substitution E — >■ 
P((E U {A})*) defined by a ^- a.D where 

D ^ {u.A.v I u,v eE* and {\v\ = 2\u\) or {\v\ = 2\u\ + 1) } 

then D is a context free language over the alphabet (E U {A}) and g{B) — C 
holds. 

Assume now that B is an a;-power in the form . Then g{B) — {g(y)Y is 
also an cu-power. 

Let then E = {0, 1} be an alphabet containing two letters and 1 and W = 
0*.l. Then — (0*.!)'^ is the set of a;-words over the alphabet E which 
contain infinitely many occcurrences of the letter 1. It is a well known example 
of an w-regular language which is a Ila-complete subset of E'^. 

Thus the language g{W) is a finitary context free language such that {g{W))'^ 
is an analytic but non Borel set. 

This language g{W) is in fact a one counter language. 

This gives an answer to Niwinski's question and additional answer to questions 
of Simonnet who asked in [Sim92] for the topological complexity of the ui- 
powers of context free languages. 
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6 Arithmetical properties 

We are going to deduce from the previous proofs some new results about 
w-context free languages and the Arithmetical hierarchy. We recall first the 
definition of the Arithmetical hierarchy of a;- languages, [Sta97a]. 

Let X be a finite alphabet. An a;- language L C X"^ belongs to the class E„ 
if and only if there exists a recursive relation Rl C (N)"~^ x X* such that 

L = {(7 e X"^ I 3ai...Qnan (ai, . . . , On-i, cr[an + 1]) G i?L} 

where Qi is one of the quantifiers V or 3 (not necessarily in an alternating 
order). An w-language L C X'^ belongs to the class Iln if and only if its 
complement X'^ — L belongs to the class E„. 

The inclusion relations that hold between the classes E„ and n„ are the same 
as for the corresponding classes of the Borel hierarchy. 

Proposition 6.1 ([Sta97a]) a) S„ U II^i C E^+i fl n„+i, for each integer 
n > 1. 

h) A set W <^ X'^ is in the class E„ if and only if its complement W" is in 
the class Yin- 

c) E„ — n„ 7^ and n„ — E„ 7^ hold for each integer n > 1 . 

The classes E„ and n„ are strictly included in the respective classes and 
of the Borel hierarchy: 

Theorem 6.2 ([Sta97a]) For each integer n > 1, E„ C and n„ C n°. 

Recall now preceding results of [FinOla]: 

Theorem 6.3 Let n he an integer > 1. Then it is undecidahle whether an 
effectively given ui-CFL is in the class E„ ( respectively n„j. 

As in the case of the Borel hierarchy, projections of arithmetical sets (of the 
second Il-class) lead beyond the Arithmetical hierarchy, to the Analytical 
hierarchy of a;- languages. The first class of this hierarchy is the class Ti\. 
An a;- language L C X^ belongs to the class Ej if and only if there exists a 
recursive relation C (N) x {0, 1}* x X* such that: 

L = {(7 e X'^ I 3t{t e {0, 1}'^ A Vn3m((n, T[m],a[m]) e Rl))} 

Then an w-language L C X'^ is in the class T\ iff it is the projection of 
an w-language over the alphabet X x {0, 1} which is in the class 112 of the 
arithmetical hierarchy. 
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It turned out that an a;- language L C X'^ is in the class T\ iff it is accepted 
by a non deterministic Turing machine (reading a;-words) with a Muller ac- 
ceptance condition [Sta97a]. This class is denoted NT{inf, =) (where (m/, =) 
indicates the Muller condition) in [Sta97a] and also called the class of recursive 
w-languages REK^. 

With the above definitions, one can state the following: 

Theorem 6.4 ([Sta97a]) The class CFL^^ is strictly included into the class 
REK^ of recursive uj-languages. 

A natural question arises: are there cj-CFL which are in the class T\ but in 
not any class of the arithmetical hierarchy? The answer can be easily derived 
from the preceding corresponding results about the Borel Hierarchy. 

Theorem 6.5 There exist some context free u-languages in Sj — Un>i ^n- 

Proof. It follows from Theorems 4.5 and 6.2. □ 

We now obtain a recursive analogue to Theorem 4.7: 

Theorem 6.6 Let S he an alphabet containing at least two letters. It is un- 
decidable, for an effectively given context free u-language B to determine 
whether B is in T\ — Un>i ^n- 

Proof. Recall that we had found (see proof of Theorem 4.7) a family of context 
free w-languages D{X,Y) over the alphabet F = {a, 6, c, «-, c?, ^4} such that 
D{X, Y) is either a Sj;-complete subset of F"^, or equal to F'^. 

Whenever D{X, Y) is S];-complete, it is not in Un>i because each arith- 
metical class S„ (respectively H^) is included in the Borel class (respec- 
tively n«). 

Whenever D{X, Y) is equal to F"^, D{X, Y) is in the class Si because of the 
characterization of oj-languages in Si [Sta97a]: an oj-language L C X'^ 
belongs to the class Si if and only if there exists a recursive finitary language 
W (ZX* such that L = W.X"^. 

But we had proved that one cannot decide which of these two cases holds, 
hence the result is proved for the alphabet F. (And we can use similar methods 
as in the proof of Theorem 4.7 to obtain the result for an alphabet of cardinal 
> 2). □ 

Considering Turing machines, we get the following: 

^ In another presentation, as in [Rog67], the recursive cj-languages are those which 
are in the intersection Si n Hi, see also [LT94]. 



19 



Theorem 6.7 It is undecidable to determine whether the complement of an 
effectively given u-CFL is accepted by a non deterministic Turing machine 
with Biichi (respectively Muller) acceptance condition. 

Proof. As in the preceding proof consider the family of context free uj- 
languages D{X, Y) over the alphabet V = {a, b, c, d, A} such that D{X, Y) 
is either a Sj;-complete subset of F^, or equal to F'^. 

Whenever Y) is Sj-complete, its complement is Ili-complete thus it is 

not a Sj; set (because a set which is both I] J and 11 J is a Borel set) and 
therefore it is not a Sj-set (because the class is included in the class S];) 
then it is not accepted by any Turing machine with Biichi (respectively Muller) 
acceptance condition. 

In the other case D{X, Y) is equal to F'^, then its complement is the empty- 
set and it is accepted by a Turing machine with Biichi (respectively Muller) 
acceptance condition. 

But we had proved that one cannot decide which of these two cases holds, 
hence the result is proved for the alphabet F. (And we can use similar methods 
as in the proof of Theorem 4.7 to obtain the result for an alphabet of cardinal 
> 2). □ 

In fact this result can be extended to other deterministic machines. Consider 
X-automata as defined in [EH93] which are automata equipped with a storage 
type X. 

Theorem 6.8 Let X. be a storage type as defined in [EH93]. Then it is unde- 
cidable to determine whether the complement of an effectively given u-CFL is 
accepted by a non deterministic ^-automaton with Biichi (respectively Muller) 
acceptance condition. 

Proof. It is similar to the previous one because every X-automaton is less 
expressive than a Turing machine hence it cannot accept any nj;-complete 
set. And conversely F"^ is accepted by every X-automaton. □ 



7 Context free languages of infinite trees 

The theory of automata reading infinite words have been extended to au- 
tomata reading infinite binary trees labelled in a finite alphabet, i.e. trees in 
a space where S is a finite alphabet (and one may also consider infinite 
A;-ary trees labelled in S but we shall restrict ourselves here to binary trees), 
see [Tho90][Tho96][LT94][Sim92] for many resuhs and references. 
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It is known that regular languages of infinite binary trees exhaust the hierarchy 
of Borel sets of finite rank as shown by Skurczynski [Sku93]. Niwinski proved 
that there exist some regular set of trees which are non Borel sets, [Niw85]. 

Some regular sets of trees are S];-complete, as Path{B) where B is any 112- 
complete regular subset of E'^. Path{B) (defined in the proof of Theorem 4.5) 
is accepted by a non deterministic tree automaton which guesses a branch of 
a tree (using the non determinism) and then simulates a finite automaton on 
the path associated with this branch. 

One can also define, for each a;-language 5 C E'^, the following sets of trees. 
Let 

V - Path{B) 

be the set of trees t in such that every path of t is in 5, and let 

Left - Path{B) 

be the set of trees t in such that the leftmost path of t is in S (the nodes 
of the leftmost branch are the words of {i, r}* which are in the form P for an 
integer n > 0). 

It is then well-known that whenever S C E'^ is an w-regular language , the 
sets V — Path{B) and Left — Path{B) are regular sets of trees. Then if B is 
a Ilj-complete subset of E'^ it holds that: 

V - Path{B-) = T^- {Path{B)) 

hence V — Path{B~) is a nj;-complete subset of T^. 

The Theorem of complementation of Rabin implies that every regular set of 
trees is in S2 fl 112, ^^^^ it has been shown that there exist regular sets of trees 
which are not in U see [LT94] for a view of a hierarchy of regular sets 
of trees. 

As finite automata have been extended to (top-down) automata on infinite 
trees, pushdown automata have been extended to (top-down) pushdown au- 
tomata on infinite trees by Saoudi [Sao92]. Denote, as in [Sao92], CF3 the 
family of languages of infinite (binary) trees accepted by (top-down) push- 
down automata with MuUer acceptance condition. 

It is easy to see from the definition of these automata that, as in the case of 
tree automata, if B is an w-CFL, then the sets of trees Path{B) and Left — 
Path{B) are accepted by tree pushdown automata. Then we can extend our 
preceding undecidability results of Theorems 4.7 and 4.10. 

Theorem 7.1 (a) Let a be a countable ordinal > 1. Then it is undecidable 
to determine whether an effectively given language in CF^ is in the Borel 
class S° ( respectively 11°^. 
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(b) It is undecidable to determine whether an effectively given language in 
CF^ is a Borel set. 

(c) It is undecidable to determine whether an effectively given language in 
CF-i is in the class Yl\ . 

(d) It is undecidable to determine whether an effectively given language in 
CF^ is a but non Borel set. 

Proof. The proofs are easily derived from the proof of Theorem 4.7. Recall we 
had got a family of omega context free languages D{X, Y) over the alphabet 
such that: either D{X,Y) = (E^)'^, or D{X,Y) is a Sj-complete subset 

of (S^)'^. But one cannot decide which case holds. 

It is easy to see that Left — Path{D[X,Y)) has the same topological com- 
plexity as the w-language D{X,Y). 

Indeed let / be the function: (S^)'^ -)■ T^^ defined by /(ex) = ta where 
tcr is the tree in T^^ with a as leftmost path and the letter A labelling 
the other nodes. Then / is continuous and f^^{Left — Path{D{X,Y))) = 
D{X,Y). Assume first that D{X,Y) is a S];-complete subset of (11^)'^, then 
Left — Path{D[X, Y)) is also at least Sj-complete and not a Borel set. 
Now let j be the function T^^ (^a)'^ defined by: j{t) is the leftmost 
path of the tree t. Then j is a continuous function and j~^{D{X,Y)) ~ 
Left-Path{D{X, Y)). Hence when D{X, Y) is a S^-complete subset of (S^)'^, 
Left — Path{D{X, Y)) is a Sj;-sct because the class Sj; is closed under inverse 
of continuous functions. Thus Left — Path{D{X, Y)) is a S{-complete subset 
of T^^ and not a IlJ-set. 

In the other case D{X,Y) = (E^)'^ and Left - Path{D{X,Y)) = T^^ then 
Left — Path{D{X,Y)) is in every Borel class and also in the class Ilj;. But 
one cannot decide which case holds. This proves (a), (6), (c) and {d). □ 



8 Concluding remarks and further work 

We have proved in [FinOla] that the class of a;-CFL exhausts the finite ranks 
of the Borel hierarchy and in this paper (Theorem 4.5) that there exist some 
analytic but non Borel ui-CFL. 

The question to know whether there exist some a;-CFL which are Borel sets 
of infinite rank is still open. 

There exists a refinement of the Borel hierarchy which is called the Wadge 
hierarchy of Borel sets. We proved in [FinOlb] that the length of the Wadge 
hierarchy of w-CFL is an ordinal greater than or equal to the Cantor ordinal 
£q. And it remains to find the exact length of the Wadge hierarchy of Borel 
a;-CFL. 

Mention that on the other side, the Wadge hierarchy of deterministic a;-CFL 
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has been determined, its length is the ordinal uj^'^'^\ It has been recently studied 
in [DFROl] [Dup99] [Fin99]. 

Acknowledgments. We have previously proved the existence of analytic but 
non Borel sets in another class of w-languages, the class of locally finite uo- 
languages [FinOlc]. We are indebted to Jean-Pierre Ressayre who suggested 
the way to adapt the original proof to the context free case. 



References 



[Bar92] R. Barua, The Hausdorff-Kuratowski Hierarchy of w-Regular Languages 
and a Hierarchy of Muller Automata, Theoretical Computer Science 96 (1992), 
345-360. 

[Ber79] J. Berstel, Transductions and Context Free Languages, Teubner 
Studienbiicher Informatik, 1979. 

[BN80] L. Boasson and M. Nivat, Adherences of Languages, J. Comput. System Sci. 
20 (1980) 3, 285-309. 

[BiicGOa] J.R. Biichi, Weak Second Order Arithmetic and Finite Automata, 
Zeitschrift fur Mathematische Logik und Grundlagen der Mathematik, 6 (1960), 
pp 66-92. 

[BiicGOb] J.R. Biichi, On a Decision Method in Restricted Second Order Arithmetic, 
Logic Methodology and Philosophy of Science, ( Proc. 1960 Int. Congr. ) 
Stanford University Press, 1962, 1-11. 

[BS73] J.R. Biichi, D. Siefkes, The Monadic Second Order Theory of All Countable 
Ordinals, Decidable Theories 2, 1973, S.L.N.M. , number 328. 

[CG77] R. S. Cohen and A. Y. Gold, Theory of w-Languages, Parts one and two, J. 
Computer and System Science 15 (1977) 2, 169-184 and 185-208. 

[CG78] R. S. Cohen and A. Y. Gold, Computations on Deterministic Pushdown 
Machines, J. Computer and System Science (1978) 3, 257-300. 

[CP97] O. Carton and D. Perrin, Chains and Superchains for a;-Rational Sets, 
Automata and Semigroups, International Journal of Algebra and Computation 
Vol. 7, No 7(1997) p. 673-695. 

[CP99] O. Carton and D. Perrin, The Wagner Hierarchy of Ct-'-Rational Sets, 
International Journal of Algebra and Computation Vol. 9, No 5 (1999) p. 597- 
620. 

[Dup95a] J. Duparc, La Forme Normale des Boreliens de Rang Fini, Ph.D. Thesis, 
Universite Paris 7, 1995. 

[Dup95b] J. Duparc, The Normal form of Borel sets. Part 1: Borel Sets of Finite 
Rank, C.R.A.S. Paris, t.320, Serie 1, p.651-656, 1995. 



23 



[DupOl] J. Duparc, Wadge Hierarchy and Veblen Hierarchy: part 1: Borel Sets of 
Finite Rank, Journal of SymboUc Logic, 66 (2001), no. 1, p. 56-86. 

[DFROl] J. Duparc, O. Finkcl and J-P. Ressayre, Computer Science and the Fine 
Structure of Borel Sets, Theoretical Computer Science, Volume 257 (1-2), April 
2001, p.85-105. 

[Dup99] J. Duparc, A Hierarchy of Context Free Omega Languages, Theoretical 
Computer Science, to appear. 

[Eil74] S. Eilenberg, Automata, Languages and Machines, Vol. A, Academic Press, 
New York, 1974. 

[EH93] J. Engelfriet and H. J. Hoogeboom, X-Automata on w- Words, Theoretical 
Computer Science 110 (1993) 1, 1-51. 

[Fin99] O. Finkel, Wadge Hierarchy of Deterministic Omega Context Free 
Languages, in preparation. 

[FinOla] O. Finkel, Topological Properties of Omega Context Free Languages, 
Theoretical Computer Science, Volume 262 (1-2), July 2001, p. 669-697. 

[FinOlb] O. Finkel, Wadge Hierarchy of Omega Context Free Languages, Theoretical 
Computer Science, Volume 269 (1-2), October 2001, p. 283-315. 

[FinOlc] O. Finkel, Topological Complexity of Locally Finite Omega Languages, 
submitted to Annals of Pure and Applied Logic. 

[Gin66] S. Ginsburg, The Mathematical Theory of Context Free Languages, Mc 
Graw-Hill Book Company, New York, 1966. 

[HU69] J.E. Hopcroft and J.D. Ullman, Formal Languages and their Relation to 
Automata, Addison-Wesley Publishing Company, Reading, Massachussetts, 
1969. 

[Kam85] M. Kaminsky, A Classification of w-Regular Languages, Theoretical 
Computer Science 36 (1985), 217-229. 

[Kec95] A.S. Kechris, Classical Descriptive Set Theory, Springer- Verlag, 1995. 

[Kur66] K. Kuratowski, Topology, Academic Press, New York 1966. 

[Lan69] L. H. Landweber, Decision Problems for w-Automata, Math. Syst. Theory 
3 (1969) 4,376-384. 

[LT94] H. Lcscow and W. Thomas, Logical Specifications of Infinite Computations, 
In:" A Decade of Concurrency" (J. W. de Bakker et al., eds). Springer LNCS 
803 (1994), 583-621. 

[Lin76] M. Linna, On w-Sets Associated with Context-Free Languages, Inform. 
Control 31 (1976) 3, 272-293. 

[Lin77] M. Linna, A Decidability Result for Deterministic w- Context-Free 
Languages, Theoretical Computer Science 4 (1977), 83-98. 



24 



[Mos80] Y. N. Moschovakis, Descriptive Set Theory, North-Holland, Amsterdam 
1980. 

[Niv77] M. Nivat, Mots Infinis Engendres par une Grammaire Algebrique, RAIRO 
Infer. Theor. 11 (1977), 311-327. 

[Niv78] M. Nivat, Sur les Ensembles de Mots Infinis Engendres par une Grammaire 
Algebrique, RAIRO Infer. Theor. 12 (1978), 259-278. 

[Niw85] D. Niwinski, An Example of Non Borel Set of Infinite Trees Recognizable 
by a Rabin Automaton, in Polish, Manuscript, University of Warsaw, 1985. 

[Niw90] D. Niwinski, Problem on cj-Powers Posed in the Proceedings of the 1990 
Workshop "Logics and Recognizable Sets" (Univ. Kiel). 

[PPOl] D. Perrin and J.-E. Pin, Infinite Words, Book in preparation, available from 
|http : / / www . liaf a. j ussieu.fr /j ep /Infinit eWor ds . html 

[MaN66] R. Mac Naughton, Testing and Generating Infinite Sequences by a Finite 
Automaton, Information and Control 9 (1966), 521-530. 

[Rog67] H. Rogers, Theory of Recursive Functions and Effective Computability, 
McGraw-Hih, New York, 1967. 

[Sao92] A. Saoudi, Pushdown Automata On Infinite Trees and Nondeterministic 
Context-Free Programs, International Journal of Foundations of Computer 
Science Vol.3, Number 1, (1992) 21-39. 

[Sel98] V. Selivanov, Fine Hierarchy of Regular w-Languages, Theoretical Computer 
Science 191(1998) p.37-59. 

[Sim92] P. Simonnet, Automates et Theorie Descriptive, Ph. D. Thesis, Universite 
Paris 7, March 1992. 

[Sim93] P. Simonnet, Automate d' Arbres Infinis et Choix Borelien, C.R.A.S. Paris, 
t.316, Serie 1, p. 97-100, 1993. 

[Sku93] Jerzy Skurczynski, the Borel Hierarchy is Infinite in the Class of Regular 
Sets of Trees, Theoretical Computer Science 112, (1993) 413-418. 

[Sta97a] L. Staiger, w-Languages, Chapter of the Handbook of Formal Languages, 
Vol 3, edited by G. Rozenberg and A. Salomaa, Springer- Verlag, Berlin, 1997. 

[Sta97b] L. Staiger, On u-Powei Languages, in New Trends in Formal Languages, 
Control, Coperation, and Combinatorics, Lecture Notes in Computer Science 
1218, Springer- Verlag, Berlin 1997, 377-393. 

[Tho90] W. Thomas, Automata on Infinite Objects, in: J. Van Leeuwen, ed.. 
Handbook of Theoretical Computer Science, Vol. B ( Elsevier, Amsterdam, 
1990 ), p. 133-191. 

[Tho95] W. Thomas, On the Synthesis of Strategies in Infinite Games, in STACS'95, 
Volume 900 of LNCS, p. 1-13, 1995. 



25 



[Tho96] W. Thomas, Languages, Automata and Logic, in Handbook of Formal 
Languages Theory, Vol 3, Edited by G. Rozenberg and A. Salomaa, Springer 
Verlag, 1996. 

[Wad84] W.W. Wadge, Ph. D. Thesis, Berkeley, 1984. 

[Wag79] K. Wagner, On Omega Regular Sets, Inform, and Control 43 (1979) p. 
123-177. 



26 



